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1. Introduction. 



In all the fire distribution problems that we have considered 
previously [14]-[19] an implicit assumption has always been that fire could 
be instantaneously shifted from one target type to another. To illustrate, 
let us recall a typical problem: 



maximize {ry(T) - px (T) - qx (T)}, 
♦ <t) 1 Z 

dx l 

subject to: = - <{>a^y, 

dx 2 

— = -d^)a 2 y, 



df ' ‘ Vl ‘ b 2 x 2’ 



( 1 ) 



x i ,x 2*y ^ ^ an< ^ 0 ^ ^ 1, 



and with initial conditions 



x 1 (t=0) = x°, x 2 (t=0) - x°, y(t*0) = y Q « 



In this problem <j> (the fraction of the Y-forces which fires at X^) 
is the control (decision or policy) variable. The reader should note that 
although the control must satisfy the condition 0 £ <j> £ 1 , the rate of 
change of <j> is unrestricted so that <J) can instantaneously change, for 
example, from 0 to 1. Physically, this means that we are assuming 
that the Y-forces can instantaneously shift fires as desired. 

When one considers command and control problems in combat, the 
above implicit assumption on 4> (instantaneous jumps permitted) does not 
seem to be a realistic one. A better assumption appears to be that 
there is a limit to how fast <j> can be changed. For example, consider a 
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homogeneous Y-force in combat against heterogeneous enemy forces, A 
command and control system directs the fire of the homogeneous Y-force 
against particular enemy targets. The effectiveness of the command and 
control system might be measured in terms of the speed and accuracy with 
which units of the Y-force react to orders as to which type of enemy unit 
at which to fire (see [13] for some similar ideas). This fire distribution 
process may be described in terms of a distribution of fire variable <j> . 

We have thus been led to consider target selection problems in which the 
rate of change of the allocation variable is bounded (i.e. instantaneous 
shifts in fire are not allowed) . For reasons discussed below, we have 
chosen to call such a situation "inertial combat." 

Although problems in which curves are restricted to lie in a given 
domain were considered in the classical calculus of variations as long ago 
as 1831 [5] (see also [1]) and discussed by Weierstrass in his lectures of 
1879 (see p. 395 of [1]), development of optimality conditions for optimal 
control problems with state variable inequality constraints has been 
accomplished only comparatively recently. As the author pointed out in 
[19], state variable inequality constraints (SVIC's) are present in all 
Lanchester-type optimal control/differential game problems. Recent activity 
in developing necessary conditions of optimality for problems with SVIC ’ s 
apparently owes its origin to the work of Gamkrelidze (for an English 
translation of his original work see Chapter VI of [11]). Gamkrelidze 
points out that in many physical problems there are restrictions not only 
on the control parameters but also on the state (phase) space. He (see 
p. 263 of [11]) refers to piecewise continuous controls as "inertialess 
controls," since such controls can, if need by, instantaneously jump from 
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one value to another. Following Gamkrelidze then, we use the term inertial 
combat to refer to a target selection problem in which the rate of change 
of the distribution of fire is bounded. 

In [17] the author first applied the theory of SVIC f s (the approach 

of Gamkrelidze [11]) to an allocation problem in the Lanchester theory of 

combat. In this first application little more than developing optimality 

conditions on constrained subarcs was done. In [19] we introduced a more 

convenient approach to first order SVIC f s (the approach of Speyer (see [6])) 

and used theory (including corner conditions and boundary conditions for 

the adjoint variables (see also [21])) to completely solve a problem 

similar to (1). The paper at hand further extends such results: we con- 

* 

sider a problem with a second order SVIC (as well as first order SVIC s) . 
This application is possible because of theoretical results recently obtained 
by the author who extended Gamkrelidze T s multiplier condition [11] (see also 
[20]) to a p — order SVIC [22] (see also [21]). The reader can find a 
further discussion of the theory of SVIC 1 s in [17] and [19] (see also [6] 
and [10 ] ) . 

This paper is organized in the following fashion. First, we discuss 
the optimal control problem (optimal fire distribution in the presence of 
command and control limitations). Then, the basic necessary conditions 
of optimality are developed for the problem. Next, the synthesis of the 
extremal fire distribution policy is outlined in several cases. The 
determination of the optimal fire distribution policy is discussed, and 

This terminology was apparently first coined by Bryson, Denham, and Dreyfus 
[3]. They say that a problem has a pJi!l order SVIC when the pJill time 
(total) derivative of the state-variable constraint is the first derivative 
to explicitly contain the control variables. 
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a more general model considered. Finally, we discuss the insights gained 
into the optimization of combat dynamics from our study of the problem at 
hand . 



2 . The Fire Distribution Problem in the Presence of Command and Control 
Limitations . 

Accordingly, we consider the following problem: 



maximize (ry(T) - px^(T) - qx^(T)} with specified, 



u(t) 



dx i 

subject to: 

dx 2 

d r = - (1 -* )a 2 y> 

■ - Vl - b 2 X 2’ 

dd> 

dt = U > 



( 2 ) 



,x 2 ,y ^ 0, T^T^, 0 £ <f> £ 1, and - s: u s: R^ : 



and with initial conditions 



^(t-0) = x°, x 2 (t=0) = x°, y(t=0) = y Q , 4>(t=0) = <t> Q , 



where all symbols are defined in the next section and >0* It 

should be noted in the above model it is no longer possible for <J> to 
instantaneously change from, for example, 0 to 1 as it had been for 
(1) . As we discussed in the introduction, this is how we incorporate 
command and control limitations into our model. 

We will focus primarily upon the development of the basic necessary 
conditions of optimality for (2) and the synthesis of extremal control from 



5 



these. As discussed more thoroughly in Section 8, it is not practical 
for computational reasons to completely carry out the determination of 
the optimal control. Thus, for this more limited goal of characterizing 
the optimal fire distribution policy the nature of the planning horizon 
(terminal target set or prescribed duration) doesn T t make any difference, 
and we will be purposely vague on this point. 

The reader should note that the control variable in problem (2) is 
u, while 0 (the control variable in problem (1)) is now a state variable . 
Hence, the restriction 0 £ 0 g 1 is now equivalent to two (first order) 
SVIC f s. When we use the approach of Gamkrelidze (see Chapter VI of [11]) 

(as modified by Bryson et al . [3] (see also [6])), a SVIC such as 
C(t,x_^(t)) £ 0 is replaced by the point constraint 



C ( t * x . ) = 0, 
entry l 



and the control inequality constraint on the state boundary (C=0) 



dC 

~ 7 ~~ ( t , x . , u) £ 0 for t 6 [t ,t . 1. 

dt l entry 5 exit 



Thus, for <j> - 1 £ 0 , we treat boundary arcs when 0 = 1 by considering 

0(t ) = 1 and then requiring 

T entry 

4^ = u £ 0 for t € [t ,t . ] when 0-1 = 0, (3) 

dt entry 5 exit J y 

and for -0^0, we treat boundary arcs when 0=0 by considering 



0 ( t ) = 0 and then requiring 

entry 

- 4^ = - u :£ 0 for t 6 [t ,t . ] when -0 = 0. (4) 

dt entry exit 



However, see [ 14 ] — [ 18 ] for the type of considerations (i.e. enumeration 
of all possible terminal states) required for developing a complete solution 
to such a problem. 
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To avoid being encumbered by too many symbols, we will consider 
only one of the two SVIC ! s x j _> x 2 ^ 0* Clearly, we lose no generality in 
considering ^ 0. In this case, we have 



dC 

dt 



C(t,x i ) 

(t.x^ 




dt 



* 0 , 

= ^y, 




(t.x^u) = ua^ - <j>a 1 (b 1 x 1 +b 2 x 2 ) . 



(5) 

( 6 ) 

(7) 



On a constrainted subarc on which x, (t) =0 for t . i t i t . , 

l v ' entry exit’ 

the SVIC is replaced by the point constraints 



and 



C(t ,x . ) = - x. (t ) = 0, 
entry’ 1 1 entry 



dC , , 

dt entry’ X i 



dx l 

dt ^entry^ ^ ^ t entry^ a l y 



( 8 ) 

(9) 



and the control inequality constraint 



d 2 C 

dt 7 



(t,x i> u) = ua^y - <j>a ;L (b 1 x 1 +b 2 x 2 ) £ 0 for t € 



[t „ ,t .1. (10) 

entry exit 



Thus, x^ £ 0 is a second order SVIC. Clearly, when x^(t) = 0 for a 
finite interval of time, by (6) we must have 4>*(t) = 0 (since y > 0) 
and then (7) yields u*(t) = 0, where we have considered the state 
equations (2) . 



3. Notation. 



The symbols which are used in this paper are defined as follows: 



a^,a 2> b^,b 2 = constant attrition-rate coefficients, 



(C)(t ) = ^ 



t=t 
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F(<(>,u) = rate of change of fraction of Y-fire directed at in more 

general redistribution of fire model, 



H = Hamiltonian function, 



J = criterion (return) functional = ry(T) - px^(T) - qx^CT) , 

p,q,r = utilities assigned to surviving , X^ and Y forces 
respectively , 

p (t) for i = 1,2, 3, 4 = dual variable corresponding to x.(t) 

(x 3 (t) = y(t), x^(t) = <Kt)), 1 

R ,R = lower and upper bounds on magnitude of rate of change of 
U (i.e. £ u <: R u ) , 

t = time after beginning of battle, 



t = time of entry to constrained subarc, 
e J 



t- = lim t, 
6 tr*t 

e 

t£t 



t° = time of entry to constrained subarc with <f>(t) =0 for t° £ t £ t° 
6 (similarly for t*) , 

t 0 = time of leaving constrained subarc, 



t - time at which u*(t) switches from to -R^ with 0 < <j> < 1 ; 



defined by p^(t=t g ) = 0, 



t^ = time at which X^ is annihilated, i.e. x^(t^) = 



T = time at which battle ends, 



= maximum possible duration for battle, i.e. T £ , 

u = control variable for redistribution of fire, 



V = v ( t ) = a 1 (-p 1 (t)) - a 2 (-p 2 (t)), 
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W = Bellman's optimal value function, 

i 

x ,x ,y = combatant force levels; with initial values 
6 , 61,62 = positive constants, 

n 1 (t) (n 2 ( t )) = multiplier corresponding to state variable inequality 
constraint <j> £ 1 (<J>^0) , 



y(t) = multiplier corresponding to state variable inequality constraint 

x 1 ^ 0, 

Vq = multiplier corresponding to intermediate equality constraint x^(t^) 
used to (formally) handle entry to a constrained subarc with 
x x (t) = 0. 

v. for i = 1,2,3 = multiplier corresponding to state variable terminal 

inequality constraint x^T) ^ 0 (x^(T) = y(T)), 

v (v ) = multiplier corresponding to state variable terminal inequality 
constraint <(>(T) £ 1 (<j> (T) ^ 0), 

= fraction of Y-fire directed at X^, 

T = "backwards time" from the end of the battle; defined by x = T - t, 
i.e. the time remaining before the end of battle, 



x^ = "backwards time" of the first change in the sign of the switching 
function v, i.e. v(t=T~x^) = 0. 

4 . Characterization of an Optimal Fire Distribution Policy . 

Using Garakrelidze's approach and considering (3), (4), ( 8 ), (9), 
and (10), we have that the Hamiltonian is given by [2], [ 6 ], [11] 



H(t,x i> p i ,u) = -p^a^y - p 2 (l-<J>)a 2 y - P 3 ( b 1 x 1 +b 2 x 2 ) + P 4 U 

- n 1 (t)u + n 2 (t)u - u(t){ua 1 y- 4 ia 1 (b 1 x 1 +b 2 x 2 )} , ( 11 ) 



where 
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{ 



and 



= 0 


for 


<P < 1 > 






n 2 (t) { 


= 0 


for 


* > o, 


^ 0 


for 


♦ - 1, 






* ( 


^ 0 


for 


o 

it 






/ 


= 0 


for 


x > 0, 












y(t) { 






1 












{ 


^ 0 


for 


X 

It 

O 









We have adopted above the following correspondence between state and 
dual variables: 

state variable dual variable 



y 

* 



v 

v 

V 



Again, to avoid being encumbered with too many symbols, we have only 
considered one (i.e. ^ 0) of the two SVIC's x ^ ,x 2 ^ ot ^ er 

SVIC (i.e. ^ 0) is handled in a similar way. The adjoint system of 

differential equations for the dual variables is 



dp l . 


3H 


dt 


- ax 3 


dp 2 


3H 


dt 


3x 2 


dp 3 


_ 9H 


dt 


3y 


dp 4 


9H 


dt 


d<p 



* t’ 1 p 3 - y(t)())a 1 b 1 . 



= b 2 P 3 - y(t)<{»a 1 b 2 . 



* 1*1 



2*2 



= (a 1 p 1 ~a 2 p 2 )y - p(t)a 1 (b 1 x ;L +b 2 x 2 ) 



( 12 ) 



(13) 



(14) 



(15) 



The boundary conditions at t = T for the adjoint (or dual) variables 
may be written 

P 1 (t-T) = -p + v x , p 2 (t=T) = -q + v 2 , p 3 (t-T) = r + V 3 , p 4 (t=T) = " v. , 
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where for i = 1,...,5 are undetermined multipliers which require 

the specification of additional information to be further delineated. To 
* 

this end, let us consider the case in which Y loses with T < (i.e. 

T defined by y(T) = 0). From the transversality condition H(T,x^(T) , 
p^(T) ,u* (T) ) = 0, we obtain p ^ (T) = 0. Then, we have [2], [21] 

p^T) = -p + v x> P 2 (T) = -q + v 2> P 3 (T) = 0, p 4 (T) = v 4 - v 5> (16) 

where 

! - 0 for x^T) > 0, 

^ 0 for x^(T) = 0 but x^(t) > 0 for t < T, 

unrestricted when x.(t) = 0 for t. ^ t ^ T 

l l 

with t. < T, 

l 

and 



= 0 for <f>(T) 


* 

o 

A 


t - 0 


for 


4>(T) < 1, 


4 * 2 : 0 for <f> (T) 


= 0, 


Vs< Lo 


for 


♦(T) = 1. 


When x^yX^yy > 0 


and 0 < <J> < 1 , 


the (extremal) 


control law is 



determined by the maximum principle. Hence, we consider 



maximize H(t,x ,p ,u) , 

- r l** r u 



and this yields 



R for p,(t) > 0, 

u*(t) = { 

‘ -R^ for P 4 (t) < 0. 



(17) 



* 

Other cases are handled in a similar manner. See [19] for a problem in 
which the boundary conditions for the adjoint variables are worked out for 
all the battle* s end states. 
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We must further investigate the possibility of singular subarcs [7] (also 

3H 

see Chapter 8 in [2]) on which -r— = 0 for a finite interval of time (so 

a U 

3H 

that all its time derivatives vanish) • The condition that — = 0 yields 

3 u 

that on a singular subarc we must have 



P 4 (t) = 0. 



d 3H 

The condition — (— ) = 0 then yields 



3Ha 

3u^ 



(18) 



a i P l (t) = a 2 P 2 (t) * 



(19) 



Proceeding to the next time derivative, we would have on a singular subarc 
on which (18) and (19) hold that 




( 20 ) 



From (20), we see that a singular solution is impossible, since it is 
impossible (in general) to have -j— 77 - = 0 for a finite interval of 

time. 



4.1. Necessary Conditions of Optimality on Constrained Subarc for j>_ . 

On a constrained subarc on which d>(t) - 1 for t £ t £ t n the 

e x 

dd> 

control is determined by — = 0 and hence 

dt 



u*(t) = 0 for tj, < t < t 2 - 



( 21 ) 



3H 

The multiplier n n (t) is determined by the condition — = 0 and hence 

1 a U 



n l (t) = P 4 ( ' t) • 



( 22 ) 



The requirement that n^(t) ^0 yields that on the constrained subarc we 



must have 
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P 4 (t) * 0. (23) 

Differentiating (23) and combining with (15) , we obtain 

n x (t) = y(a 1 p 1 ~a 2 p 2 ) , (24) 

♦ 

so that Gamkrelidze 1 s condition rj^(t) ^ 0 [11] (see also [20]) is only 

satisfied on a constrained subarc with <}> = 1 when 

^(-PlCt)) ^ a 2 (~P ( t) ) > (25) 

which the reader will, of course, recognize as a result for the correspond- 
ing M inertialess !l combat problem (see [14], [19]). Denoting the time of 
an entrance corner by t^ and that of an exit corner by t^, the corner 
conditions (see [17] and [19] for further discussion, especially for 
corners interior to the state space (which are not explicitly discussed 
here)) yield that at an entrance corner we have [10] 

Pi(t e ) = P i (t^) for i = 1,2,3, (26) 

and 

P 4 ( V = ° = P 4 (t t } ‘ n i (t t } > (27) 

or 

p 4 (t t } = (28) 

where t denotes a left-hand limit. The reader should note that (28) 
e 

is in consonance with (22). Furthermore, at an exit corner we have 

P i (t J^ = P i (t l^ for 1 = 1 > 2 ’ 3 ’ 4 * 

Considering either n-^t^) =0 or lUt^) = H(t+) , we find that 



( 29 ) 
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P 4 (t l> ‘ 0 = P4 (t t>' 



(30) 



Considering (24), (28), and again n^(t^) = 0, we see that when there 
is an exit from the constrained subarc at t ° t^ , then (28) yields 



, -K , 

p. (t ) = n, (t ) = 



y(t){a 2 p 2 (t) - a 1 p 1 (t)}dt £ 0. 



(31) 



t 

e 

On a constrained subarc on which $(t) =0 for £ t £ the 
details are similar to the above with the control again given by (21) , 
since again = 0, Determinations similar to the above yield 



n 2 (t) = -P 4 (t) , 



(32) 



and that it is necessary on the constrained subarc that 



P 4 (t) £ 0, (33) 

and 

^1 (“ p l ( t) ) £ a 2 (-p 2 (t)) , (34) 

the latter condition (34) being a consequence of Gamkrelidze 1 s multiplier 
condition ^(t) ^ 0* Corner conditions similar to (26) through (30) also 
must hold. When there is an exit from the constrained subarc at t = t. , 
we find that 



, , +v 

P 4 (t e ) - -n 2 (t e ) - 



y(t){a 2 P 2 (t) - a 1 P 1 (t)}dt ^ 0. 



(35) 
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4.2. Necessary Conditions of Optimality on Constrained Subarc for x ^ « 

On a constrained subarc on which x_ (t) « 0 for t £ t £ T the 

,2 1 e 

control is determined by = 0 which (along with the requirement that 

dx^ 

dt 



= 0) yields 



u*(t) = 0 for < t s T, 



(36) 



and 



<j>(t) = 0 for £ t £ T. 



(37) 



3H 



The multiplier y(t) is determined by the condition t— = 0 and hence 

OU 



P 4 (t) = u(t)a 1 y = 0, 



(38) 



or 



y(t) = 



P 4 (t) 

a i y 



(39) 



Differentiating (38) with respect to time and combining with (15) , we 
* 

obtain 



u<t > -^ c vrV2>- 



(40) 



Further differentiation and combination with (12) , (13) , and the condition 
<j>(t) = 0 yields that 

Po(t) 

y(t) = — ( a 1 b i" a 2 b 2 ) . (41) 



The necessary condition of optimality on a constrained subarc of a second 

k d k 

order SVIC [22] is that (-1) — £ ^ 0 for k = 0,1,2, and hence consid- 

dt 

ering (39) , (40) , and (41) we must have 



This result was obtained after cancellation of a term y(t) . Hence, a 
different argument is required when y(t) = 0. This latter condition, 
however, only occurs at most at a single isolated point t = T. See [19] 
for a similar occurrence and further discussion. 
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P 4 (t) * 0, (42) 

a l(- p l(t)) ^ a 2 (-p 2 ( t) ) , (43) 

and a ^ a 2 h 2’ (44) 



since it is readily shown that p^(t) > 0 for t < T. It is of interest 
to note that (43) and (44) have previously been shown [17] to be necessary 
conditions of optimality for having x^ = 0 for a finite interval of time 
in the inertialless combat problem (1) . 

As in [14], [19] let us make the nonrestrictive assumption that 
a^b^ > a 2 b 2‘ This then implies that it is non-optimal to have = 0 
for a finite interval of time, since (44) must hold with the sense of the 
inequality reversed on such a constrained subarc. 

For a second order SVIC C(t,x^) £ 0, we must have at an entrance 
corner to a constrained subarc [10] 



p i (t e) - Pi<0 - v o It: 



(t ) - U(0 
e e 



3x, 



{ (C) (t )} 
e 



for i =• 1, 



(45) 



and 



H(tj = H(t+) + v |£ (t ) + U(tb 7T { (C) (t ) } , 
e e Odte edt e 



(46) 



where [6] ^ 0. Recalling (5) and (6), we find for the problem at hand 

at an entrance at t = t to a constrained subarc on which x.. = 0 that 

e 1 

the following hold 



p i« : e) = P i (tg) for i = 2,3, 

p 4 (t e } = P4 (t t ) ‘ u(c t )a l y - 



(47) 

(48) 



( 49 ) 
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From (38) we have 

P 4 ( 0 = y(t+)a iy> (50) 

so that (49) yields 

P 4 (tT) = 0. (51) 

Furthermore, the corner condition (46) (which reads H(t^) ~ H(t*)) is 
satisfied when (48) and (49) hold. It should be clear that no trajectory 
can leave a constrained subarc on which x^ = 0 after entry. Hence, we 
omit discussion of the corner conditions at exit corners. 

It will now be shown that we must have \>q > 0. Considering the 
nonrestrictive assumption a^b^ > a 2^2* ^ s ^ lou -'- c ^ be c ^ ear that we must 

have > 0 for 0 < t < t^ (we might have <j>(t-0) = 0). Then 

<}>(t) > 0 for 0 < t < t and ^ ( t) « 0 for t £ t with (j)(t) being 
continuous imply that u*(t e ) = -R^, and thus by (17) and (51) 

P 4 (t) < 0 for t 6 (t e -6,t e ) with p^(t^) » 0, (52) 

where 5 > 0 is a suitably chosen constant, and we have used the fact 
that it is impossible for p^(t) = 0 for a finite interval of time (this 
was established when we showed the impossibility of a singular solution) . 
Expanding p^(t) for t < t^ in a Taylor series about t = t^, we find 
that 

d P / 

(t e -t) (t) = (-p 4 (t)) > 0, (53) 

~ dp 4 

where t* £ (t,t ). Hence, (t) > 0 for all t 6 (t -6- ,t ) where 

e at e 1 e 

0 < 6^ £ <5. Recalling (15) and that y > 0, we have 

a lPl(t) > a 2 p 2 (t) for t € (t e ~6 1 ,t e ) , 



(54) 
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so that in the limit we have 






( 55 ) 



Next, we show that we must have 



WV =• a 2 p 2 (t e ) - 



(56) 



The proof is by contradiction. Considering (55), we assume that a n p_(t ) 

11 e 



= a o p 0 (t ). Again expand p, (t) for t < t in a Taylor series 
Lie 4 e 



about 



t = t . Recalling (15) and using the above assumption and (51) , we find 
that for t < t 



(t e -t ) 2 d 2 p 4 _ 

' p 4^ = 2 dt 2 ^ ’ 



(57) 



where t € (t,t ). Using the state and adjoint equations, we readily 
compute that 
d 2 P 4 

■jp r - 1 (O = yp 3 (t) (a 1 b 1 -a 2 b 2 ) - (a^ (t) -a^ ( t) } (b^+b^) . (58) 

By the continuity of the dual variables between corners, t can be chosen 



€(t -fi,t ) such that for all t € (t,t ) we have 
e e e 



d 2 p. 



dt 



(?) > 0 , 



and hence by (57) we have a contradiction to (52). Thus, (56) must hold. 
Next, we show that we must have 



a l P l (t e } " a 2 P 2 ( 0- 



(59) 



This follows immediately from p^(t) > 0 for t < T =* ii(t) > 0 for 

t £ t < T. Then y(t=T) £ 0 =* y(t) < 0 for t < t < T, 
e e 

to (40) yields the desired result (59) . 



and reference 
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The proof that > 0 now readily follows. First, we observe 

that (48) , (56) , and (59) yield 

a l P l (t e ) * a 2 P 2 (t ? = a 2 P 2 (t e 5 > a l p l ( 0’ (60) 

or 

p^t") > ( 6i ) 

whence follows > 0 by (47) . Moreover, is chosen so that <j> = 0 

* 

precisely when x^(t^) = 

5 • Synthesis of Extremal Policy when x^(T) ,x 0 (T) > 0 . 

In this and the next two sections we synthesize the extremal fire 

** 

distribution policy for all cases in which Y loses with T < (the 

same case for which the boundary conditions for the adjoint variables were 
given in Section 4) . By the synthesis of the extremal control we mean the 
explicit determination (using the necessary conditions of optimality) of 
the time history of the extremal control from initial to terminal time 
(see [ 17 ] — [ 19 ] for further, more extensive discussion). 

The basic idea is to trace extremals backwards from a given 
terminal state in such a way as to guarantee the satisfaction of the initial 



* 

This multiplier Vq arises because the system loses two degrees of freedom 
when it enters the constrained subarc (see pp. 411-412 of [10]). For a first 
order SVIC, the value of the multiplier y(t£) at the entrance corner accounts 
for the loss of one degree of freedom by the system upon entering the con- 
strained subarc. For a second order SVIC, there are two degrees of freedom 
lost this way (for the problem at hand, x^ = 0 and <j> = 0 on the constrained 
subarc), one of which is accounted for by y(t^). 

Other cases are handled in a similar manner. See [18] and [19] for problems 
in which this is done for every end state of battle. 

By an extremal we mean a path (or trajectory) on which the necessary condi- 
tions of optimality are satisfied at every point in time. 
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conditions. Thus, it is convenient to introduce the ’’backwards time” 

ariable 
d 2 d 2 



variable x defined by x = T - t. We observe that - = — r~ but 

dt dx 



dt z dx 4 



It is also convenient to define 



v(t) = a 1 (-p 1 (t)) - a 2 (-p 2 (t)). 



(62) 



so that differentiation and combination with (12) and (13) yield 



dt P 3^ t ' ) ^ a l b l a 2 b 2' ) ’ 



(63) 



Then our nonrestrictive assumption that a^b^ > yields that 



dv 

dt 



(t) < 0 for all t < T, 



(64) 



since it is easily shown that p^(t) > 0 for t < T. 
convenient to write (15) (for > 0) as 



Using (62) , it is 



dp^ 

dT 



= " yv. 



and hence 



d P 4 . s dv 

dP" ‘ (b l x l +b 2 x 2 )v - y dT 



(65) 



(66 J 



In synthesizing an extremal there are two cases to consider: 



Case (a) a^p ^ a^q, 

Case (b) a^p < 

For Case (a) : a^p ^ a 2 ^ > -*- s conven i enC to first observe that 

a Taylor series expansion of p^(x) about x = 0 yields for x ^ 0 

dp, 2 d2 P' 

p 4 (t) = P4^ t=0 ^ + T ^7“ ( T= °) + 'Y dx^ (67) 

where i £ (0,x). In this case we have 
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v(t= 0) = a 1 p - a 2 q ;> 0, (68) 

so that considering (64) it is readily seen that 

v(t) > 0 for t > 0, (69) 

and hence (65) and (66) yield 

dp 4 

-j— (t= 0) = y(T) (a 1 p-a 2 q) = 0, (70) 

since y(T) = 0 and 

d 2 P 4 

’ (O > 0 for T > 0. (71) 

Furthermore, there are three subcases to be considered when 
a i P ^ a 2 q: 

Subcase (al) $ (t=T) = 0, 

Subcase (a2) 0 < <j>(t=T) < 1, 

Subcase (a3) 4> ( t=T) = 1. 

We will now show that Subcase (al) is not consistent with an optimal policy 
and work out details for the other two cases. 

Subcase (al) : <Kt=T) = Q when a ^ p ^ a ^q . 

Since 4> ( t=T) = 0, (16) yields that p^(r=0) = ^ 0 . Then (67), 

(70) , and (71) yield that 

p^(x) > 0 for t > 0, (72) 

If the system would be on a constrained subarc for a finite interval of 

time, i.e. 4>(t) = 0 for t £ t £ T, then (72) is a violation of the 

e 

necessary condition of optimality (33), (We also note that Gamkrelidze 1 s 
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necessary condition r\ ^ (t) ^ 0 is violated, since by (32) , (65) and (69) 

we have n 0 (t) * yv(t) > 0 for t £ t < T.) If we were not on a con- 
l e 

strained subarc for a finite interval of time, then we must have 4 >(t) > 0 
for 0 < x < 6 with <}>(t= 0) = 0. This implies that u*(x) = -R^ for 
0 £ x < 6^ ^ 6, and this is impossible by (17) and (72). Hence this 
case is inconsistent with an optimal policy. 

Subcase (a2) : 0 < <ft(t-T) < 1 when a ^ p ^ a ^q . 

Since 0 < <J>(t=T) < 1, (16) yields p^(x=0) = 0, and (72) again 

follows- Then by (17) and (72) we have 



u*(t) = R^ for 0 £ x ^ T. 



(73) 



Denoting <J>(t=0) by we then have + R^T = <f>(t=T) <1, so that 

this case happens when T < (1 -<|>q)/R^. The extremal policy is then given 
by 



u*(t) = R u for 0 <: t <: T < (l-^/R^ 



(74) 



For longer times we must go to the next subcase. 

Subcase (a3) : 4> ( t=T) = 1 when a p ^ a ^q . 

Since 4>(t=T) = 1, (16) now yields p^(x=0) = ^ 0. It may be 

shown that a contradiction arises unless = 0. Hence, we must have 

p^(x*0) = 0. If <J>(t) < 1 for T - 6 £ t < T where 6 > 0, then the 
development of the previous subcase holds. If we are on a constrained 

subarc with 4>(t) = 1 fox t^ £ t £ T, then by (24) and (62) we have 

• • 

H 1 (t) = -yv(t) . Hence Gamkrelidze T s necessary condition n 7 (t) £ 0 is 

satisfied by (69). Thus, we can remain (in backwards progression at the 
end t = T) on the constrained subarc until we have to get off to meet 
the initial condition 4>(t=0) = As we work backwards and leave the 
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constrained subarc (but in forwards time enter at t ) , the corner 

e 

condition (27) and a Taylor series expansion of p^(t) about t « t^ 
yield for 0 £ t £ t 



dp 4 (t £ -t ) 2 d 2 p 4 

P 4 (t) = dt“ (t=t e } + 2 dT 2- (t=t) ’ 



(75) 



where tT £ (t,t ). Recalling (65), (69), and (71), we see that p^(t) £ 0 
for 0 £ t £ t so that the rest of the analysis is similar to that of 
the preceding subcase. Hence 



when T ^ (1 -<J>q) /R^, we have 

for 0 £ t £ (l-(j) Q )/R u , u*(t) = Ry and <f)*(t) = <j> 0 + R^t, 

for (1 -$q)/R^ < t T, u*(t) = 0 and $*(t) = 1. (76) 



For Case (b) ; a i p < a 2 q> we ^ ave t ^ iat (recalling (62)) 

v(x=0) = a^p - a^q < 0. (77) 

From (64) we have ( T ) > 0 for t > 0 with v(x) a continuous 

dx 

function (see [2] and above corner conditions at boundary of state space 
(26)), and thus at some (backwards) time v(x) must become zero. Denote 
this "backwards time" as x^. Thus v(x=x^) = 0. There are, again, 
three subcases to be considered when a^p < 

Subcase (bl) $(t=T) = 

Subcase (b2) 0 < <j>(t=T) < 1, 

Subcase (b3) ( t =T) = 1. 

Analysis of these subcases is similar to that given for Case (a) with 
Subcase (b3) being impossible. 
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Let us now observe (recalling (25) and (34)) that in order to 
satisfy Gamkrelidze 1 s condition on constrained subarcs with $ = 0 or 
~ 1 , we must have 

v(t) ^ 0 when (j>(t) = 1 for a finite interval of time, 
and v(t) 0 when <{> ( t ) = 0 for a finite interval of time. 



(78) 

(79) 



We have previously noted in Section 4.1 the correspondence of these results 

to those for "inert ialless" combat. We now consider the case when 

4>(t=T) = 0. Let t° denote the (forward) time when the system enters 

e 

a constrained subarc with <j> = 0; similarly t^ denotes the time of 
leaving one with $ = 1. We further assume that 



$(t) = 0 for t° £ t ^ T. 

e 



( 30 ) 



We now prove that it is impossible to have v(t=t°) = 0; in fact, 



v(t) must be 


< 0 


before <j> = 0. We begin by observing that 


<Kt=t°) = o 

e 


with 0 < <t>(t) 


< 1 


for 


t° - 8 < t < t° where 6 > 0. Hence, 
e e 


c 

* 

rt 

II 

1 


and p^(t) ^ 0 


by 


(17) 


for t° - 6., < t < t° where 0 < 5., £ 
e 1 e 1 


6 . Con- 



sidering a Taylor series expansion about t = t° and (27) , (65) , and (66) , 

we have for t £ t° 

e 



/ ° \ 
(t -t) 

/\ / o \ / 0 \/ 0 \. e 

p.(t) = (t -t)y(t )v(t ) + x — 

4 e e e Z 



where t 6 (t,t°). Considering that t 
easily seen that v(t°) ^ 0 =* p^(t) > 0 
by the above. Hence, 



{ (b 1 x 1 +b 2 x 2 )v('t) - y(t) — (”t)}, 

< t°, (64), and (81), it is 

for t < t° , which is impossible 
e 



(8 



v(t=t°) < 0. 
e 



( 82 ) 
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Now, it is readily shown that for t* £ t £ t^, p^(t) = [ y(s)v(s)ds 

so that recalling (30) 



p^(t^ + ) = | y(s)v(s)ds. 



(83) 



Then, the continuity of v(t) , (64) and (82) yield that v(tj^) > 0. Denote 
the time at which v = 0 as T - x- so that v(t=T-x ) = 0. Then since 

ft 1 1 



we also have p^(t) = 



^y(s)v(s)ds, it follows that p^(t) < 0 for 



t* < t < t° and hence by (17) 



u*(t) = -R^ for t* < t < t°. 

It follows that t° - t* = 1/R^. The times t* and t° are determined 
by the conditions 



T - T 1 € (t*,t°) where v(t=T-t 1 ) = 0, 



(84) 



and 



y(s)v(s)ds = 0, 



(85) 



which may be written as 

T-t, 



o 

t 

re 



y(t)v(t)dt = - 



y(t)v(t)dt. 



T-x, 



( 86 ) 



SL ‘1 

The relationship of the times tj , T-x., , and t° to the time histories 

a 1 e 

of <}>(t) and v(t) are shown in Figure 1. 
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Omitting further details, we reach an important conclusion: for the 

"inertial" combat (2) one begins to redistribute fire earlier in forward 
time (anticipating changes in target priority) than in the corresponding 
"inertialless" case (1) , Again, the reader is referred to Figure 1 for 
motivation of this statement. 

The above considerations on tracing extremals backwards from the 

various terminal states of battle are summarized in Table I. This table 

shows from which of the end states extremals lead. An entry like "possible 

(but not too likely)" or "possible (but unlikely)" means that the domain 

£ 

of controllability for extremals to that end state is a "rather small" 

*By the domain of controllability for extremals to an end state we mean the 
set of points in the initial state space from which extremals lead to the 
terminal state. 
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Table I. Possible End States of Battle for 
Extremals when x^(T) ^^(T) > 0. 





a l P * a 2 q 


a L P < a 2 q 


4> (T) = 0 






(on constrained subarc 


impossible 


possible 


for finite time interval) 


(and quite likely) 


d> (T) = 0 


impossible 


possible 


(but 4> ( t) >0 for T-6<t<T) 




(but unlikely) 


0 < <t>(T) <1 


possible 

(but not too likely) 


possible 

(but not too likely) 


4> (T) =1 

(on constrained subarc 


possible 


impossible 


for finite time interval) 


(and quite likely) 




<KT) = 1 


possible 


impossible 


(but 4>(t)<l for T-6<t<T) 


(but unlikely) 
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subset of the initial state space in contrast to an entry like ’’possible 
(and quite likely) 1 ’ for which the corresponding set of initial conditions 
is ’’much larger . ” It has not been possible to develop explicit formulas 
for these domains of controllability (as done, for example, in [14], [19]) 



6 . Synthesis of Extremal Policy when x ^ (T) = 0 and x ^ (T) > 0 . 
There are two cases to consider: 



Case (a) on constrained subarc for finite interval of time, 
Case (b) x^(t) > 0 for t < T. 



For Case (a) : x ^(t) = 0 for t ^ ^ t ^ T with t ^ < T , it is 
clear that we must have <f)(t) = 0 for t^ £ t £ T so that integration 
of the adjoint equations (13) through (15) and (63) with the boundary 
conditions (16) yields that for t^ £ t £ T we have 



P 2 (t) = -q cosh/a^b^ (T-t) , 



P 3 (t) = qy — sinh ^ a 2 'b' 2 ' (T-t), 



(87) 

( 88 ) 



v(t) = v(T) + (a,b -a b ){cosh I'a b (T-t) - 1} with v(T) £ 0, 

1 1 z z z z 

the requirement that v(T) ^ 0 being a consequence of (43) . We also have 
by (40) and (62) that 



v(t) =* U (T) + — 

a i 



v(s)ds with u(T) ^ 0. 



( 90 ) 



Recalling our nonrestrictive assumption that > £ 2 —? anc * (39) 9 

consideration of (89) yields that 



(89) 
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P 4 (t) > 0 


for 


t-. £ t < T with p.(t=T) = 0 
1 4 


and 


< 

/~\ 

rr 

V-/ 

V 

o 


for 


t 1 £ t < T with v(t=T) ^ 0. 



Hence, we see that the necessary conditions of optimality (42) through 
(44) are always satisfied on the constrained subarc. Thus, there are no 
restrictions on when such an extremal can occur. 

We next show that we must have v(t^) < 0. The proof is by con- 
tradiction. First, we observe that x^(t) > 0 for t < t^ and x^(t=t 1 ) » 0 
imply that 



u*(t) = -R^ for t^ - 6 < t < t^ where 6 > 0. (91) 



If we had v(t^) £ 0, then (63) and the fact that p^(t) > 0 for t < T 

(which follows from (88)) yield that v(t) > 0 for t < t . This yields 

p^(t) > 0 for t < t^ by (65) and the condition (51) that p^(t^) 55 

However, (91) is impossible if p^(t) > 0. 

It should be noted that v(t^) < 0 and v(t*) > 0 guarantee that 

* 

Vq > 0, since from (47), (48), and (62) it follows that 



v 



0 



— {v(tb - v(t j> . 
a^ 1 1 



It should be recalled that is chosen so that = 0 when x^(t^) “ 0. 

The value of Vq depends on x°, x°, , 4 >q, and T^. The multiplier 

is unrestricted and is chosen so that v(T) = (a^p-a^q) - a^v^ ^ 0* 

Considering the corner conditions (47) , (48) , and (51) , we have shown 

that 



This condition was shown to be necessary for optimality in Section 4.2. 
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P 1 (t 1 ) < o,* P 2 (t 1 ) < 0, P 3 (t~) > 0, P 4 (t 1 ) = 0, v(c x ) < 0. (92) 

Further details for 0 £ t < t^ in the backwards synthesis are similar 
to those given for Case (b) in Section 5 and are therefore omitted. Some 
possibilities for the synthesized extremal control are shown in Table II. 
Other variations in the form of the synthesized extremal control are 
possible, and the reader should have no difficulty in identifying them. 



For Case (b) : x ^(t) > 0 for t < T , there are three subcases to 

be considered: 

Subcase (bl) 4>(t=T) = 0, 

Subcase (b2) 0 < 4>(t=T) < 1, 

Subcase (b 3 ) <j>(t=T) = 1. 

Further analysis yields the results shown in Table III. This shows from 
which of the end states extremals lead. We will now sketch how these 
results were obtained. 

Subcase (bl) : <ft(t=T) = 0 . 

Since <J>(t=T) = 0, (16) yields that p^(x=0) = ^ 0. It is clear 

that we must have <f>(t) > 0 for t £ (T-6,T) for some 6 > 0. Hence, 
by (17) and (20) 



u*(t) = -R^ for t 6 (T-6 1 ,T) c (T-6,T) . (93) 

We now prove that P^(t=0) = 0. The proof is by contradiction. If 

Pz ( t= 0) > 0, then p, (t) > 0 for t € (T-6~,T) and this contradicts (93) 
*4 4 Z 



To establish this result one makes the identification p^(t) = - x , t y > 
where W denotes Bellman r s optimal value function. It follows 1 
that p- (t) < 0, since addition of more X at t cannot help but reduce 
Y ! s optimal return. A justification of this argument is given in [18]. 
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Table II. 


Some Possibilities for 
Collateral Information 
Constrained Subarc for 
(Must Have v(T) ^ 0) . 


Synthesized 
for Case in 
Finite Time 


Extremal Controls 
which (T) = 0 

Interval; and x^ 


and 

(On 

(X) > 0 


Case (1) • 


time, t 


2*CO 


v&L 


U*(t) 




X- L (t) 


t=0 


>0 


>0 


*U 


0^4><i 


>0 


Octet 1 

e 


>0 


>0 


*u 


0<4>< 1 


>0 


t»t 1- 

e 


0 


>0 


*U 


1 


>0 


t =t 1+ 

e 


>0 


>0 


0 


1 


>0 


t 1 < t< t » 


>0 


>0 


0 


1 


>0 


e t 


t-t 1 " 


0 


>0 


0 


1 


>0 


1+ 

tat i 


0 


>0 


_r l 


1 


>0 


t^CtCT-ti 


<0 


>0 




0«(><1 


>0 


t=5 T-T 1 


<0 


0 


_r l 


0<4>< 1 


>0 


X-T 1 <t<t 1 


<0 


<0 


-*L 


0<4><1 


>0 


t=t i 


0 


<0 




0 


0 


t = t t 


>0 


>0 


0 


0 


0 


t L <t<X 


>0 


>0 


0 


0 


0 


t=X 


0 


>0 


0 


0 


0 


Case (2) • 


time , t 


E*<0 


v ( t) 


U*(t) 


MO 




t=0 


>0 


>0 


*u 


O^jxl 


>0 


0< t< t 

s 


>0 


>0 




0<$<1 


>0 


t=t 

1 

t=t 

s 


0 


>0 


*u 


0<4><L 


>0 


0 


>0 


-*4. 


0<4><l 


>0 


V t<T - T i 


<0 


>0 


-*L 


0<4><1 


>0 


'-T-'l 


<0 


0 


-*L 


0<(J><1 


>0 


X-T^<t<t^ 


<0 


<0 


-»L 


0<4> <1 


>0 


t=t i 


0 


<0 


_r l 


0 


0 


t=t l 


>0 


>0 


0 


0 


0 


t^<t<X 


>0 


>0 


0 


0 


0 


t=X 


0 


2>0 


0 


0 


0 


Case (3) . 


time , t 


£ 4 ( t ) 


V<t) 


u*(t) 


MO 


x 1 (t) 


t-0 


£0 


>0 


_r l 


0<<j>£l 


>0 


0 <t <T-t^ 


<0 


>0 


-“l 


0<<jxl 


>0 


t=x-ti 


<0 


0 


-*L 


0<4><1 


>0 


X-T 1 <t<t 1 


<0 


<0 


-“l 


0<$<1 


>0 


t-t£ 


0 


<0 


-r l 


0 


0 




>0 


>0 


0 


0 


0 


t 1 <t<T 


>0 


>0 


0 


0 


0 


t=T 


0 


^0 


0 


0 


0 
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Table III. Possible End States of Battle for 

Extremals when x^(T) = 0 (But x^(t) > 0 
for t < T) and x 2 (T) > 0. 





a l P * a 2 q 


a l p < a 2 q 


4>(T) » 0 

(on constrained subarc 
for finite time interval) 


impossible 


impossible 


<KT) =0 

(but 4> ( t) >0 for T-<S<t<T) 


possible 

(but very unlikely) 


possible 

(but very unlikely) 


o < <kt) < i 


possible 

(but not too likely) 


possible 

(but not too likely) 


+ (T) =1 

(on constrained subarc 
for finite time interval) 


possible 

(and quite likely) 


impossible 


4>CT) = 1 

(but 4> ( t ) < 1 for T-6<t<T) 


possible 
(but unlikely) 


impossible 
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by (17). Furthermore, we must have v(T) < 0. Again, the proof is by 

3 W 

contradiction. First, we remark that by considering p^(t) = - — r— r (see 

J dy (t; 

[18] for further discussion) we obtain that P 3 ( t ) > 0 for t < T. Thus, 

by (63) we have 4^ ( T ) > 0 for t > 0. If we had v(T) ^ 0, then we 
dT 

would have v(t) > 0 for t < T, and hence p^(t) > 0 for t € (T-d^jT) 

via (65) and p^(T) ■ 0. However, by (17) this contradicts (93). 

By (16) we see that v(T) < 0 =» — (a 1 p-a„q) < v 1 £ p, the latter 

a^ j. z x 

inequality a consequence of requiring p^(T) :£ 0. It is clear that 
can always be chosen to satisfy the above conditions regardless of whether 
a p £ a 2 q or a^p < a 2 q. > this subcase is always possible for the 

appropriate initial values of the state variables . Furthermore, we have 

Pl (T) £ 0, p 2 (T) < 0, P;J (T) - 0, p 4 (T) = 0, v(T) < 0, 

so that the synthesized extremal control function can take any of the forms 
shown in Table III for 0 £ t £ t^, the realization of any particular 
form being dependent upon the state variable initial conditions. 

Subcase (b2) : 0 < <j>(t=T) < ! • 

Since 0 < <£(t=T) < 1, (16) yields that p 4 (t=0) = 0. Moreover, 
for 0 < 4>(T) ^ 1 the transversality condition H(T) =0 no longer holds, 
since when x^(T) = 0 ( = y(T)), variations in control 5u cannot increase 

T because this would lead to violation of the constraint x^ ^ 0 if the 
planning horizon were extended to T + dT with x^(T) = 0 and <f>(T) > 0. 
Then, a one-sided version of the usual variational argument [2] yields 
(after dropping some terms) dJ = H(T)dT £ 0 with dT £ 0 , where J 
denotes the (augmented) return functional. This yields H(T) ^ 0 and 
consequently p^(T) £ 0. Again using the argument which considers 



33 



p 3 ( T ) = 3 y ^ T ) [18], it follows that p 3 (T) = 0 (and also p 3 (t) > 0 

for t < T) . 

Now, v(t=0) = (a^p-a^q) - a^v^ where 0 £ £ p , the latter 

inequality being a consequence of requiring p^(T) ^ 0* For a^p ^ a^q, 

we can have v(t=0) either ^ 0 or <0. When v(t=0) ^ 0, the resulting 

synthesized extremal control takes a form similar to that shown in Table IV 

* 

below. When v(t=0) <0, it takes any of the forms shown in Table II 
for 0 £ t t^ . In all cases is chosen so that x^(T) = 0. The 

realization of the synthesized extremal control depends upon the state 
variable initial conditions. 

Subcase (b 3 ) : <f> ( t =T ) = 1 . 

Since <£(t=T) = 1, (16) now yields p^(i=0) = £ 0. If $(t) < 1 

for T - 6 £ t < T where 6 > 0, then previous arguments (i.e. proof by 
contradiction) yield that p^(x=0) = 0. Next, we show that we must have 
v(t=T) ^ 0. The proof is by contradiction. First, we observe that 
<j>(t) < 1 for T - 6 £ t < T implies that 

u*(t) = for t € (T-6 X ,T) c (T-6 ,T) . (94) 

If v(T) < 0, then v(t) < 0 for t € (T-6^,T). Considering a Taylor 
series expansion about t = T, we obtain p^(t) = (T-t)y(t) v(t) where 
t 6 (t,T), since p^(T) = 0. But then t £ (T-6 ? ,T) fl (T-6^,T) yields 
p^(t) < 0 and this contradicts (94) by (17). Since v(T) = (a^p-a^q) - 
a^v^ ^ 0 where ^ 0, it follows that this subcase with 4>(t) < 1 

for t < T is only possible when a^p ^ a 2 ^* 

If we are on a constrained subarc with <J>(t) = 1 for t* £ t £ T , 
then (23) and (25) must hold. The former yields p 4 (x=0) = 0, while the 

This is the only case possible when a^p < a^q. 
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latter yields 0 £ £ — (a^p-a^q) so that this subcase with <j>(t) = 1 

for t^ £ t £ T is only possible when a^p ^ a 2^ # T ^ e ^ urt ^ er synthesis 
of the extremal control now follows previous arguments. The synthesized 
extremal control is shown in Table IV. 



Table IV. Synthesized Extremal Control and Collateral Information 





for Case in 


which x^(T) = 


0 (But x 1 (t) 


> 0 for 


t < T) 




and x 2 (T) > 


0 with 4> ( t) 


= 1 for t 1 ^ 
e 


t £ T. 




time, t 


£^(t) 


v(t) 


U*(t) 


t(t) 




t=0 


>0 


>0 


“u 


0^4><1 


>0 


Octet 1 

e 


>0 


>0 




0<<t><l 


>0 


t-t 1 ' 

e 


0 


>0 


0 


1 


>0 


t=t 1+ 

e 


>0 


>0 


0 


1 


>0 


t^tcT 

e 


>0 


>0 


0 


1 


>0 


t=T 


0 




0 


1 


0 


Note: This 


case is only 


possible when 


a 1 p ;> a 2 q. 







7. Synthesis of Extremal Policy when x^(T) > 2 aru * (T) = 0 . 

When x^CT) =0, we must have x^Ct) > 0 for t < T, since it 

is nonoptimal (see Section 4.2 above) to be on a constrained subarc with 
x^ = 0 for a finite interval of time. There are then three cases to be 
considered : 

Case (a) $ ( t=T) = 0 , 

Case (b) 0 < cf>(t=T) < 1, 



Case (c) 4> ( t=T ) = 1. 
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Further analysis now yields the results shown in Table V. This shows 
from which of the end states extremals lead. The symmetry of these results 
(interchange x^ and x^) in relation to those shown in Table III for 
x^(T) = 0 but x^(t) > 0 for t < T should be noted. We will now sketch 
how these results were obtained. 

For Case (a) : j)(t=T) = 0 , (16) yields that p^(i=0) = ^ 0 . 

Observing that <{>(t) > 0 for t 6 (T-6,T) again implies (93), it follows 

dp 4 

by previous arguments that p^(x=0) = 0. Also, (93) implies that (t) < 0 

for t € (0,6^) c: (0,6^), and hence v(x=0) < 0 by (65). Observing that 

v(t= £ 0) - (a^p-a^q) + f°H° ws this case with <j>(t) > 0 for 

t < T is only possible for a^p < a^q. The synthesized extremal control 

takes the form shown in Table II for 0 £ t £ t^. 

If we are on a constrained subarc with <£(t) = 0 for t° £ t £ T, 

then (33) and (34) must hold, whence it follows that p^(x=0) = 0 and 

v (i=0) = (a p-a q) + a v £ 0 or 0 g v £ — (a q-a p) . Then p (t=0) = 
12 22 Z Z 1 2 

-q + V < 0 so that we must have v(T) < 0, since for t° £ t £ T we 

(q-'O 

have v(t) = v(T) + (a.b ~a 0 b ) — (cosh /a 0 b 0 (T-t) - 1} and v(t) £ 0 

1 1 2 2 b 9 2 2 

Z 1 

by (34) and (62). Thus, 0 £ v < — (a q-a p) so that this case with 

2 a^ 2 x 

4>(t) = 0 for t° £ t :£ T is only possible when a^p < a^q. The synthesized 
extremal control is shown in Table VI. 

For Case (b) : 0 < 4>(t=T) < 1 , (16) yields that p 4 (t= 0) =0. If 

a^p £ a^q, then v(T) ^ 0. The requirement that P 2 (T) £ 0 yields 

dv 

0 :£ q so that by (14) we have p^(t) > 0 for t < T, whence — (x) 

> 0 for all t > 0. Hence, v(x) > 0 for t > 0, and the synthesized 
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Table V. Possible End States of Battle for 

Extremals when x^T) > 0 and x 2 (T) » 0 
(But x 2 (t) > 0 for t < T) . 



a l P k a 2 q 



a x P < a 2 q 



♦ CT) - 0 

(on constrained subarc 
for finite time interval) 


impossible 


possible 

(and quite likely) 


<t>(T) =0 


impossible 


possible 


(but $ (t)>0 for T-<5<t<T) 




(but unlikely) 


n < a f t'i < i 


possible 


possible 




(but not too likely) 


(but not too likely) 


< — 1 
It 

✓— N 

H 

-e- 






(on constrained subarc 


impossible 


impossible 


for finite time interval) 






<KT) =1 


possible 


possible 


(but $(t)<l for T-d<t<T) 


(but very unlikely) 


(but very unlikely) 
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Table VI. One Possibility for Synthesized Extremal Control and 
Collateral Information for Case in which x^(T) > 0 





and x 


2 (T) = 0 (but 


x^(t) > 0 for 


t < T) With 






+(o = 


0 for t° £ t 
e 


s: T. 






time , t 




v(t) 


u*(t) 




x 9 (t; 


t=0 


>0 


>0 


*U 


0^4><i 


>0 


0<t< t 1 
e 


>0 


>0 




0<4><1 


>0 


t=t l_ 

e 


0 


>0 




1 


>0 


t=t l+ 

e 


>0 


>0 


0 


1 


>0 


e l 


>0 


>0 


0 


l 


>0 




0 


>0 


0 


1 


>0 




0 


>0 


-*L 


l 


>0 




<0 


>0 




0«J><1 


>0 


t=T-x 


<0 


0 


_r l 


0<4*< 1 


>0 


T-t <t<t° 
1 e 


<0 


<0 


-®L 


0<4><1 


>0 


o- 

t=t 

e 


0 


<0 


-*L 


0 


>0 


o*f 

t=t 

e 


<0 


<0 


0 


0 


>0 


o 

t < t<T 
e 


<0 


<0 


0 


0 


>0 


t=T 


0 


<0 


0 


0 


0 


Notes: (1) 


This case 


is only possible when a^p < 


a 2 q. 





(2) Variations in the synthesized extremal control analogous 



to those shown in Table II are possible. 
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control is as shown in Table IV for 0 £ t £ t^. If a^p < a^q , then we 
can have either v(T) ^ 0 or v(T) < 0. The former case has just been 
discussed, while the latter case yields a synthesized extremal control like 
one of those shown in Table II for 0 £ t £ t^ . 

For Case (c) : <j> ( t=T) = 1 , (16) yields that p^(x=0) = £ 0. 

It is clear that we must have <f>(0 < 1 for t 6 (T-<$,T), since (t) > 0 
for t < T and x^CT) = 0. Previous arguments readily yield p^(x=0) = 0 
and 0 £ v 2 £ q (since p 2 (T) ^ 0) * If a^p :> a^q, then v(T) ^ 0 and 
further results are similar to those of the previous case. If a^p < a 2 q, 
then v(T) is either ^ 0 or <0. The case in which v(T) ^ 0 has 

just been discussed. If v(T) < 0, this leads to p^(t) < 0 for 

t 6 (T-fi 2 ,T) . By (17) this is impossible, however, since we must have (94) 
hold, since <Kt) < 1 for t 6 (T-6,T). Thus, this case is always possible 
for the appropriate initial values of the state variables, although we must 
always have v(T) ^ 0. The synthesized extremal control is the same as for 
Case (b) with v(T) ^ 0. 

8 . Determination of the Optimal Fire Distribution Policy . 

It remains to discuss how the optimal fire distribution policy may 
be determined from among the extremal control policies developed in the 
previous sections. Two ways of proving the optimality of an extremal 
trajectory are as follows (see [17], [18]): 

(a) show that sufficient conditions of optimality are satisfied on 
the extremal, 

(b) by citing the appropriate existence theorem, show that an optimal 
control exists for the problem at hand; there are two further 
subcases: (1) if the extremal is unique, then it is optimal or 
(2) if the extremal is not unique and only a finite number exist, 
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then the optimal trajectory is determined by considering the 
finite number of alternatives. 

In the first case there are both local and global sufficiency theorems 

to be considered. Neither, however, is convenient to apply to the problem 

(2) at hand. The usual control theory sufficient conditions for a local 

maximum (see pp. 181-184 of [2]) are not satisfied, since the problem (2) 

3 * 

is singular (in the sense that -r — 2 = 0) • Sufficient conditions for a 

du 

global maximum depend upon the appropriate functions being concave and 
the planning horizon being fixed in length [4], [9]. The latter condition 
is not satisfied for (2), since, for example, the battle can end by Y 
being annihilated at any time before T^. 

Thus, as in earlier papers [14], [18], [19], we are again led back 
to the enumeration of all extremals in various regions of the state space 
(i.e. the intersections of the domains of controllability for extremals 
leading to the various terminal states of battle (see [14], [19])) and 
comparison of corresponding returns. The author has not been able to develop 
analytic, closed-form results for the integration of the state equations (2) 
in the general case (much less for the determination of domains of con- 
trollability or computation of extremal returns). Thus, it has not been 
possible to analytically determine the optimal control from among the candi- 
date extremal controls for the problem at hand as was done in [14] (see 
also [19 ] ) . 

Moreover, the existence of an optimal control readily follows from 
the control variable u being uniformly bounded (see Corollary 2 on p. 262 



Recently, conditions sufficient for a local maximum have been developed for 
the singular control problem. These are, however, essentially impossible 
to check for the problem at hand (see [18]). 

For given initial and parameter values this may be done numerically by 
following the steps outlined in [14] (or [15]). 
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of [8])* The uniform boundedness of responses is a consequence of u 
being required to lie in a compact set (i.e. < -R^ £ u £ < +°°. 

Let us observe that for -R^ = and R^ = +» problem (2) is equivalent 

to problem (1) , the "inertialless" combat problem. In this case there 
will be jumps In state variable <j> in problem (2). Additional consid- 
erations are now required in the development of necessary conditions of 
optimality (see [12], [23], [24]). It should be pointed out, moreover, 
that the existence theorems of Lee and Markus [8] (and others) only apply 
for admissible trajectories which are absolutely continuous. Hence, they 
no longer can be invoked to insure the existence of an optimal control when 
— R^ — — <» and R^ ~ +o° . 



9 . More General Redistribution of Fire Model . 

In the model (2) considered above the rate of redistribution of 
fire by Y firers was assumed to be bounded, i.e. -R^ ^ 
not to be dependent upon the distribution of fire. It is of considerable 
interest and importance for us to continue our investigation of the 
dependence of the structure of the optimal fire distribution policy upon 
model form (see [16]). The above simple model (2) for redistribution of 
fire is equivalent to 

^-Fi(u), (95) 

where (a) F^(u=0) - 0 and (b) F^(u) is a concave function for u £ [-R^,R^] 

with F|(u=R L ) > 0. 

Let us consider the more general case in which the rate of redistri- 
bution of Y-fire is bounded and also dependent upon the distribution of 
fire. Thus, we consider the model 

‘See also [17] , [18] . 



d(fc 

dt 






and 



where F(<J>,u) reflects the ability of Y firers to redistribute their 
fire over X-target types. We will see below that the functional form of 
F(<j>,u) (as long as it is not pathological) does not affect the structure 
of the optimal fire distribution policy, although the time history of the 
distribution of Y-fire, i.e. 4>(t) for 0 £ t £ T , does vary, of course, 
with changes in F(<J>,u). 

What are appropriate properties to postulate for F (0 , u) in order 
to model the real world? First of all, it must be possible to keep the 
distribution of fire constant. Thus, we stipulate that if u = 0, there 
is no redistribution of fire. Also, if u > 0 (<0), then <f> increases 

(decreases) (but the rate of change is bounded) . It seems appropriate to 
postulate that there are "s tragglers" in redistributing their fire, as 
<J> approaches zero (or one) its rate of change decreases. Finally, we 
assume that all the Y firers can shift their fire in a finite interval 
of time. 

We therefore make the following assumptions about F(<j>,u) : 

(Al) for all <p £ [0,1] 

i -F £ F(<f),u) < 0 for -fL £ u < 0 , 

L F(<j>,u) =0 for u = 0, 

0 < F(<}>,u) ^ F for 0 < u £ FL 

where F , F > 0, 

L U 

(A2) F(<p,u) is piecewise in its arguments for all 

<t> € [0,1] and u 6 t-VV’ 
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(A3) for fixed 4> € [0,1], F(<j>,u) is concave in u for 

—R. £ u £ R.. with 3F / , _ N ^ rt 

"l *u ^ (♦.u-y > o, 



(A4) F(<|>,u) is a strictly | decreasing } ^ unct ^ on °f $ for 



fixed u such that / ^ U < ® l . 

(0 < u <; Ry/ 



An example of such a function is 



F(*,u) = 



u(e^+<j>) for -R^ £ u < 0, 

uCl+e^"^) for 0 s; u <; Ry, 
where e ^> e 2 > 



(97) 



Then we have, for example, <j>(t) = -e^ + (e^+<j>Q)e 
-R^ for tg £ s £ t. 



when u(s) = 



We now show that for problem (2) when (96) is used with F(<j>,u) 

satisfying (Al) through (A4) , the structure of the optimal fire distribu- 

* d6 

tion policy is the same as for problem (2) (with -pjj- =* u) . The Hamiltonian 
in this case is given by 

H(t,x i ,p i ,u) = — p 1 4>a. 1 y - P 2 0-~ «(> ) a 2 y “ P 3^ b l X l +b 2 X 2^ + P 4 F ^> U ) 

- n 1 (t)u + n 2 (t)u - u(t){F(<J> .u^y - i))a 1 (b 1 x 1 +b 2 x 2 ) } , (98) 



so that (15) is now replaced by 



dp 



d r " W ‘ p 2 a 2 y - p 4 If + p(t) {!* V • a l^ b l x l +b 2 x 2 ) } ■ <99) 



Of course, 4>(t) and dependent quantities differ in their particular form. 
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When x^,X 2 »y > 0 and 0 < <J> < 1, the extremal control is again 

given by (17) because of assumption (A3) . Again, it will be shown that 

there is no singular solution. This is proved as follows. Since r 5 - t 0 , 

3u 

3H 

the condition - 0 for a finite interval of time again leads to (18) 

and *~(|^} = 0 on the singular subarc so that (19) again follows. For 

an arc on which (18) and (19) hold we again have (20) so that -r— / 0, 

dt^du' 

since it has been shown that p^(t) > 0 for t < T, and we have assumed 
a^b^ > a 2^2* follows then that there is no singular solution (i.e. no 

singular subarc in the solution) . 

The analysis of constrained subarcs also follows that given above 
for problem (2) . We illustrate this for a constrained subarc on which 
x^(t) = 0 for t^ £ t £ T. Again, (36) and (37) hold. Now 

1Z ■ 1Z (P4-" a l y) ' 

3H 3F 

so that — = 0 again yields (38), since — ^ 0. It is readily shown 
du du 

in a similar fashion that (40) and (41) still hold so that the necessary 
conditions of optimality on a constrained subarc with x^(t) = 0 for a 
finite interval of time are again given by (42) through (44). Treatment 
of other types of constrained subarcs is similar and further discussion 
is omitted. 

Thus, we have shown that the characterization of an optimal fire 
distribution policy for the more general redistribution of fire model 
given above is exactly the same as that for problem (2) . 

10 . Discussion . 

In this section we discuss what we have learned about the influence 
of command and control limitations on the structure of optimal fire 
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distribution policies. The reader should bear in mind, moreover, that 
the combat model considered in this paper is, far too simple to have the 
results obtained from it be taken literally, but he should interpret these 
results as indicating general principles or hypotheses to be further 
investigated by higher resolution analysis techniques such as field experi- 
mentation or computer simulation. Nevertheless, it is hoped that our simple 
model (2) has provided some insights into the optimization of this combat 
process. Thus, it is the form of the optimal policy and its functional 
dependence on model parameters that is of primary interest. 

In this paper we have considered a version of the two-on-one fire 
distribution problem considered elsewhere [14], [18]. In the version (2) 
considered here there are limits on the rate at which the distribution of 
fire can change. This might be thought of as reflecting a command and 
control limitation (e.g. the existence of a time lag between the giving 
of an order and its implementation) . From our above study, we conclude 
that the structure of the optimal fire distribution policy for problem (2) 

(a) depends upon the model for the attrition of X-force target types and 
not upon the nature of the model for redistribution of fire (see Section 9) 
(as long as this does not change the functional form of enemy target-type 
attrition) ; (b) depends upon the following model parameters (see [16] for 
further discussion): (1) f° r ^ = (2) a^p and a 2 ^> anc * (3) 

whether Y wins or loses; and (c) is very similar to that for the inertia- 
less combat case. 

To elaborate further about (c) , the reader can find results for the 
inertialess combat problem (1) reported in [14], [18]. When these are 
compared with those for the problem at hand (2) , there are seen to be many 
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similarities (see below) . Both models have the same attrition structure 
(for X-force target types) , although the model (2) considered here incor- 
porates command and control limitations. As long as these do not affect 
the function form of the attrition process of enemy target types, the 
optimal policies are similar (see also Section 9) . 

We saw that for both "inertialess" and "inertial" combat, we must 
necessarily have 

a l b l * a 2 b 2' 

in order for it to be optimal to drive x^ to zero (while > 0 and 

before t = T) . Furthermore, we also developed a necessary condition 
involving v(t) = a^-p^t)) - a^(-p 2 (t)) for it to be optimal in (2) to 
have <j>(t) = 0 for 1 for a finite interval of time. Again, the results 
were similar to those for the "inertialess" combat problem (1) . 

The sign of the quantity v(t) = a^(-p^(t)) - reflects 

the ranking of target priorities: when v(t) >0, is a higher 

priority target for Y than is X^ ; and the situation is reversed when 
v(t) <0. A significant result (see Section 5) was that for "inertial" 
combat an optimal policy for the distribution of Y-fire over enemy target 
types is characterized by beginning to change the distribution of fire 
(i.e. shifting of fires) before target priorities (as measured by the sign 

In [13] Schreiber formulates a Lanches ter-type combat model in which the 
effectiveness of intelligence and command and control systems modifies the 
form of combatant attrition. These capabilities are incorporated into 
Schreiber ! s model through a parameter £ [0,1] which he denotes as "command 
efficiency." His equations reduce to Lanchester’s classical equations for 
area fire when "command efficiency" is equal to zero for each side and to 
those for aimed fire when it is equal to one. 
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of v(t)) change. (It should be recalled that in the "inertialess" version 
of this problem all fire was concentrated on the target type with the 
higher priority and instantaneously shifted when this changed.) In other 
words, due to decreased reaction ability one begins to change the distri- 
bution of fire before target priorities change in anticipation of this coming 
change. We might even generalize this result to say that with slow reactions 
one's optimal policy involves anticipating enemy actions. 

It should finally be pointed out that to the best of the author's 
knowledge the problem (2) considered in this paper is the first one with 

a higher order (i.e. greater than first order) SVIC (see [3], [6], [22]) 

* 

to be considered in the operations research literature. Moreover, the 
complete treatment of this problem was made possible by some recent results 
of the author [21], [22]. 



Examples of problems with an SVIC of order greater than one have appeared, 
of course, previously in the engineering literature (see [3], [6]). 
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